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SMALL MOLECULE MIMETICS OF ERYTHROPOIETIN 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

This invention relates to computer-assisted methods for identifying and designing small 
molecule mimetics of erythropoietin. 

2. Description of Related Art 

Erythropoietin (EPO) is the primary regulator of the proliferation and differentiation of 
immature erythroid cells. EPO is produced in the fetal liver and in the adult kidney in 
response to hypoxia (low oxygen levels in blood or tissue). It circulates in the blood 
stream where it targets the EPO receptor (EPOR) on committed progenitor cells in the 
bone marrow and other hematopoietic tissues. Recombinant human erythropoietin 
(rHuEPO) is widely used in therapy of patients with anaemia due to chronic renal failure, 
cancer chemotherapy and AZT treatment. 

The EPO receptor belongs to the cytokine receptor superfamily which includes receptors 
for other hematopoietic growth factors such as interleukins (ILs), colony stimulating 
factors (CSFs) as well as growth hormone prolactin and ciliary neurotrophic factor 
(CNTF). The structural architecture of this family of receptors consists of three modules: 
a ligand binding extracellular domain, a short trans membrane region and a large 
cytoplasmic domain. It has been proposed that the extracellular domain of this 
superfamily comprises two discrete domains each containing approximately 100 residues 
that fold into a sandwich consisting of 7 antiparallel P-strands with the topology of an Ig 
constant domain. Members of the family share two characteristic motifs in their 
extracellular domain: a pair of conserved disulfide bridges in the N-terminal domain, and 
a WSXWS box (where X is any amino arid residue) in the C-terminal domain. For most 
members of this receptor superfamily, oligomerization of one or more polypeptide chains 
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is essential for forming high affinity receptor complexes. A homodimer complex has been 
demonstrated to be the active form of hGHR and a similar model has been suggested for 
G-CSF, prolactin and EPO receptors. 

Erythropoietin induces dimerization of two EPO receptor molecules, which results in 
subsequent phosphorylation of the cytoplasmic domains by the association with two 
tyrosine kinase (JAK2) molecules to initiate a cascade of events that leads to the relevant 
biological. 

Given the importance of erythropoietin, it would be very desirable to be able to identify 
molecules capable of binding the EPO receptor and eliciting the response normally 
elicited by EPO. 
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SUMMARY OF THE INVENTION 

The invention features methods for identifying molecules which will bind to the EPO 
receptor and act as a EPO mimetic. Preferred EPO mimetics identified using the method 
of the invention act as agonists of the EPO receptor in one or more in vitro or in vivo 
biological assays of EPO activity. Preferred mimetics are molecules lacking peptide 
bonds, i.e., are non peptidic mimetics. Preferred peptide mimetics have 15 or fewer, more 
preferably 10 or fewer amino acids. 

The methods of the invention entail identification and design of molecules having a 
particular structure. The methods rely on the use of precise structural information derived 
from x-ray crystallographic studies of the extracellular domain of EPO receptor (amino 
acids 1 to 225) complexed with a peptide, EMP1 (EPO Mimetic Peptide 1; described 
below), which acts as an EPO mimetic. This crystallographic data permits the identifica- 
tion of atoms in the peptide mimetic that are important for EPO receptor binding and 
dimerization. More importantly, this data defines a three dimensional array of the 
important contact atoms. Other molecules which include a portion in which the atoms 
have a similar three dimensional arrangement similar to some or all of these contact 
atoms are likely to be capable of acting as an EPO mimetic. Moreover, one can use the 
structural information to design or identify molecules having even more EPO activity 
than the peptide mimetic described herein. 

The details of the preferred embodiment of the present invention are set forth in the 
accompanying drawings and the description below. Once the details of the invention are 
known, numerous additional innovations and changes will become obvious to one skilled 
in the art. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is a flowchart showing a first method for identifying potential mimetics of 
erythropoietin using a computer system. 

FIGURE 2 is a flowchart showing a second method for identifying potential mimetics of 
erythropoietin using a computer system. 



reference numbers and designations in the various drawings indicate like elements. 
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DETAILED DESCRIPTION OF THE INVENTION 

Throughout this description, the preferred embodiment and examples shown should be 
considered as exemplars, rather than as limitations on the present invention. 

Described below is the crystal structure of a small peptide mimetic of EPO bound to an 
extracellular portion of the EPO receptor. The peptide, EMP1 
(GGTYSCHFGPLTWVCKPQGG; SEQ ID NO; 1), is characterized by an intramolecular 
disulfide bridge. Several lines of evidence suggest that EMP1 can act as an EPO mimetic. 
For example, EMP1 competes with EPO in receptor binding assays and induces cellular 
proliferation of cell lines engineered to be responsive to EPO. Both EPO and peptide 
induce a similar cascade of phosphorylation events and cell cycle progression in EPO 
responsive cells. Furhter, EMP1 demonstrates significant erythropoietic effects in mice 
as monitored by two different in vivo assays of nascent red blood cell production. This 
data, when combined, strongly supports the notion that the peptide ligand, which has a 
sequence unrelated to that of EPO, is capable of binding to and inducing an agonist 
conformation or assembly of EPO receptor. 

Design of small molecule mimetics 

The structure of the EMPi aimer demonstrates that a molecule substantially smaller than 
the natural hormone can act as an agonist and induce the appropriate biological response. 
The peptide is assumed to have a substantially smaller contact interface with the receptor 
than its natural hormone. The binding determinants in the EPO receptor form an almost 
flat surface which is mainly hydrophobic in nature, without any cavities or charged 
residues that may help in design of a small molecule ligand to interact with the receptor. 

This simplified framework of interactions revealed by the structural data presented herein 
can be used to identify additional EPO mimetics. The atoms of EMPI which are 
important for binding to the EPO receptor and forming dimeric EPO receptor include 
those involved in the contact between the EMPI (peptide) and EBP (EPO receptor) and 



WO 97/41526 



PCT/US97/07218 



those involved in contacts between the two EMPl molecules in the dimeric complex 
(peptide-peptide contacts). In addition to the contacts listed in Table 2, the following 
EMP1-EMP1 hydrophobic contacts are significant: Tyr P4 , Cys", Phe M , Trp M3 , and Cys W5 
in each peptide. The following EMPl -EBP hydrophobic interactions are also significant. 
Tyr", Phe w , and Trp m in each peptide. It will be understood by those skilled in the art 
that not all of the atoms present in a significant contact residue need be present in a 
mimetic. In fact, it is only those few atoms which actually from important contacts with 
the EPO receptor which are likely to be important for mimetic activity. Those skilled in 
the art will be able to identify these important atoms based on the model of the dimeric 
EMPl -EPO complex which can be constructed using the structural data herein. 



Preferred mimetics will include atoms at postions similar to those of the EPO receptor 
contacting atoms of EMPl . Even more preferred mimetics will be structurally similar to 
the dimer of EMPl found in the structure described below. This is because the 
dimerization of EMPl is an important factor in the diemerization of the EPO receptor. 

The methods of the invention employ a computer-based methods for identifying 
compounds having a desired structure. More specifically, the invention uses the three- 
dimensional coordinates of a subset of the atoms in the peptide 
GGTYSCHFGPLTWVCKPQGG when the peptide is co-crystallized with a portion of 
the erythropoietin receptor comprising amino acids 1 to 225 of the human EPO receptor, 
to determine peptide and non-peptide mimetic candidates by means of computer methods. 

These computer-based methods fall into two broad classes: database methods and de novo 
design methods. In database methods the compound of interest is compared to all 
compounds present in a database of chemical structures and compounds whose structure 
is in some way similar to the compound of interest are identified. The structures in the 
database are based on either experimental data, generated by NMR or x-ray crystallogra- 
phy, or modeled three-dimensional structures based on two-dimensional (i.e., sequence) 
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data. In de novo design methods, models of compounds whose structure is in some way 
similar to the compound of interest are generated by a computer program using 
information derived from known structures, e.g., data generated by x-ray crystallography 
and/or theoretical rules. Such design methods can build a compound having a desired 
structure in either an atom-by-atom manner or by assembling stored small molecular 
fragments. 

The success of both database and de novo methods in identifying compounds with 
activities similar to the compound of interest depends on the identification of the 
functionally relevant portion of the compound of interest. For drugs, the functionally 
relevant portion is referred to a pharmacophore. A pharmacophore then is an arrangement 
of structural features and functional groups important for biological activity, e.g., EPO 
activity. 

Not all identified compounds having the desired pharmacophore will act as an EPO 
mimetic. The actual activity can be finally determined only by measuring the activity of 
the compound in relevant biological assays. However, the methods of the invention are 
extremely valuable because they can be used to greatly reduce the number of compounds 
which must be tested to identify an actual mimetic. 

Dimerization of the EPO receptor is important for activity. Accordingly, preferred 
mimetics will be based on the structure of the EMP1 dimer as it is bound to the EPO 
receptor dimer. Thus, preferred mimetics have include important contacts from both of 
the RWJ 61233 peptides present in the structure described below. Such mimetics will 
favor dimerization of the EPO receptor. 

Programs suitable for generating predicted three-dimensional structures from two- 
dimensional data include: Concord (Tripos Associated, St. Louis, MO), 3-D Builder 
(Chemical Design Ltd,, Oxford, U.K.), Catalyst (Bio-CAD Corp., Mountain View, CA), 
and Daylight (Abbott Laboratories, Abbott Park, IL). 
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Programs suitable for searching three-dimensional databases to identify molecules 
bearing a desired pharmacophore include: MACCS-3D and ISIS/3D (Molecular Design 
Ltd., San Leandro, CA), ChemDBS-3D (Chemical Design Ltd., Oxford, U.K.), and 
Sybyl/3DB Unity (Tripos Associates, St. Louis, MO). 

Programs suitable for pharmacophore selection and design include: DISCO (Abbott 
Laboratories, Abbott Park, IL), Catalyst (Bio-CAD Corp., Mountain View, CA), and 
ChemDBS-3D (Chemical Design Ltd., Oxford, UK). 

Databases of chemical structures are available from Cambridge Crystallographic Data 
Centre (Cambridge, U.K.) and Chemical Abstracts Service (Columbus, OH). 

De novo design programs include Ludi (Biosym Technologies Inc., San Diego, CA) and 
Aladdin (Daylight Chemical Information Systems, Irvine CA). 

Those skilled in the art will recognize that the design of a mimetic may require slight 
structural alteration or adjustment of a chemical structure designed or identified using the 
methods of the invention. 

In general, chemical compounds identified or designed using the methods of the invention 
can be sythesized chemically and then tested for EPO activity using any of the methods 
described below. The methods of the invention are particularly useful because they can 
be used to greatly decrease the number potential mimetics which must be screened for 
EPO activity. 

The invention may be implemented in hardware or software, or a combination of both. 
However, preferably, the invention is implemented in computer programs executing on 
programmable computers each comprising a processor, a data storage system (including 
volatile and non-volatile memory and/or storage elements), at least one input device, and 
at least one output device. Program code is applied to input data to perform the functions 
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described above and generate output information. The output information is applied to 
one or more output devices, in known fashion. The computer may be, for example, a 
personal computer, microcomputer, or workstation of conventional design. 

Each program is preferably implemented in a high level procedural or object oriented 
programming language to communicate with a computer system. However, the programs 
can be implemented in assembly or machine language, if desired. In any case, the 
language may be a compiled or interpreted language. 

Each such computer program is preferably stored on a storage media or device (e.g., 
ROM or magnetic diskette) readable by a general or special purpose programmable 
computer, for configuring and operating the computer when the storage media or device 
is read by the computer to perform the procedures described herein. The inventive system 
may also be considered to be implemented as a computer-readable storage medium, 
configured with a computer program, where the storage medium so configured causes a 
computer to operate in a specific and predefined manner to perform the functions 
described herein. 

FIGURE 1 is a flowchart showing a first method for identifying potential mimetics of 
erythropoietin using a computer system. The method uses a programmed computer 
comprising a processor, a data storage system, at least one input device, and at least one 
output device, and comprises the steps of: 

(1) inputting into the programmed computer through an input device data comprising 
the three-dimensional coordinates of a subset of the atoms in the peptide 
GGTYSCHFGPLTWVCKPQGG when the peptide is co-crystallized with a 
portion of the erythropoietin receptor comprising amino adds 1 to 225 of the 
receptor, thereby generating a criteria data set (STEP 100); 

(2) comparing, using the processor, the criteria data set to a computer database of 
chemical structures stored in the computer data storage system (STEP 102); 
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(3) selecting from the database, using a program suitable for searching three- 
dimensional databases to identify molecules bearing a desired pharmacophore 
(such as those described above or equivalents), chemical structures having a 
portion that is structurally similar to the criteria data set (STEP 104); 

(4) outputting to an output device the selected chemical structures having a portion 
similar to the criteria data set (STEP 106). 

FIGURE 2 is a flowchart showing a second method for identifying potential mimetics of 
erythropoietin using a computer system. The method uses a programmed computer 
comprising a processor, a data storage system, at least one input device, and at least one 
output device, and comprises the steps of: 

( 1 ) inputting into the programmed computer through an input device data comprising 
the three-dimensional coordinates of a subset of the atoms in the peptide 
GGTYSCHFGPLTWVCKPQGG when the peptide is co-crystallized with a 
portion of the erythropoietin receptor comprising amino acids 1 to 225 of the 
receptor, thereby generating a criteria data set (STEP 200); 

(2) constructing, using a program suitable for generating chemical structure models 
(such as those described above or equivalents), a model of a chemical structure 
having a portion that is structurally similar to the criteria data set (STEP 202); 

(3) outputting to the output device the constructed model (STEP 204). 

Confirmation of Biological Activity 

In order to determine whether a molecule identified using the methods of the 
invention can act as an EPO mimetic, one or more in vitro or in vivo assays of EPO 
activity should be performed. For example, mimetic molecules should be able to 
stimulate proliferation of TF-1 cells (Kitamura et al., J. Cell Physiol 140:323, 1985) or 
B6Sut cells (Greenberger et al., Proc. Natl. Acad. Sci. USA 80:2931, 1983), but 
preferably do not stimulate proliferation of cells which do not bear the EPO receptor. 
Thus, preferred mimetics do not stimulate proliferation of Mo7e cells (Avanzi et al., Br. 
J. Haematol. 69:359, 1988). 
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Potential mimetics can also be tested in a murine model of erythropoiesis. In this assay 
a potential mimetic is administered to normal mice which express endogenous basal 
levels of EPO. Reticulocytes are counted, preferably by flow cytometry, to determine 
whether the candidate mimetic increases reticulocyte levels. An increase in reticulocyte 
levels indicates that the candidate mimetic is stimulating efythropoiesis. Because the 
mice used in this assay already express EPO, this assay may be relatively insensitive. As 
an alternative, candidate mimetics can be assayed in the exhypoxic-polycythemic mouse 
bioassay. In this assay polycythemia is induced by conditioning mice in a hypobaric 
chamber to reduce endogenous EPO levels. A potential EPO mimetic can be adminis- 
tered to a conditioned mouse. Incorporation of "Fe into blood serves as a measure of 
erythropoiesis. This erythropoiesis can be attributed to the candidate mimetic. 

The assays described above are examples of suitable assays. Other assays for EPO 
activity known to those skilled in the art are also useful. 

In order to determine the biological activity of a candidate mimetic it is preferable to 
measure biological activity at several concentrations of candidate mimetic. The activity 
at a given concentration of candidate mimetic can be compared to the activity of EPO 
itself. 

Structural Data 

The coordinates for amino acids 1 to 225 of the human EPO receptor bound to peptide 
EMP1 are presented in the attached appendix in standard Brookhaven database format. 
Also included in this appendix is a list of van der Waals interactions. These coordinates 
can be used in the design and identification of EPO mimetics according to the methods 
of the invention. 

Structure of EBP-EMP1 Complex 

The extracellular fragment of human EPO receptor (EPO binding protein, EBP), 
consisting of residues 1-225, was expressed in Escherichia co» and purified as described 
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(Johnson et al., Protein Express. Purif. 7:104, 1996). Rhomboidal-shaped crystals of an 
EBP complex with EMP1 were obtained in orthorhombic space group P2,2 1 2„ with cell 
parameters a=59.2A, b=75.5A, c=132.2A, with two EBP and two peptide molecules in 
the asymmetric unit and a V M =2.8 AVdalton (Matthews, J. Mol. Biol. 33:491, 1968). The 
crystal structure was determined by multiple isomorphous replacement (MIR) using two 
heavy atom derivatives (Table 1). Residues 1-2 and 19-20 of each peptide as weU as 
residues 1-9, 21-23, 164-166, 221-225 of receptor molecule I, and residues 1-9, 21-23, 
133-135, 221-225 of receptor molecule n had poor or no electron density and are 
excluded from the structure analyses 

An important break in the electron density that affects the structure interpretation occurs 
for the three residues (Arg" -Gly 23 -Pro") that link the amino terminal o-helix to the first 
P-strand in Dl of both receptor molecules. A molecular packing diagram shows the 
proximity of a second non-crystallographically related dimer in the crystal that gives two 
possibilities of how this three-residue linker may be connected. The current choice of 
linker connectivity is based on a structure of another independent EBP-peptide complex 
at higher resolution (2.5 A), which shares a similar molecular packing, but for which the 
electron density is clear for these three residues. At present there are no experimental 
data to verify whether this N-terminal a-helix exists in solution or is a crystallization 
packing artifact. Notably, this helical region is not observed in the published structures 
of hGHbp (begins at residue 32; deVos et al., Science 255:306, 1992), PRLR (begins at 
residue 2, without any defined secondary structure until the first P-strand, residue 6; 
Somers et al., Nature 372:478, 1994), the INF-yRa (begins at residue 17; Walter et al., 
Nature 376:230, 1995) or the tissue factor (begins at residue 3 without any defined 
secondary structure until the first p-strand, residue 11; Muller et al., Nature 370:662, 
1994). 

The EBP monomer folds into two domains, Dl and D2, that form an L-shape with the 
long axis of each domain aligned at approximately 90° to each other; the overall 
molecular dimensions are 45 A x 52 A x 62 A. The N-terminal domain (Dl, residues 10- 



WO 97/41526 



PCT/US97/07218 



-13- 

1 14) and C-terminal domain (D2, residues 1 19-220) are connected by a short four residue 
a-helix linker. Both domains are more closely related in overall topology to Fibronectin 
type-m (FBN-IH) domains than to Ig domains (Bork et al., J. Mol. Biol., 242:309, 1994). 
The FBN-lll fold is composed of two antiparallel P-pleated sheets, consisting of strands 
A, B, E and strands G, F, C and C\ and is found in the two domains of the human growth 
hormone (de Vos et al., Science 255:306, 1992) and prolactin (Somers et al., Nature 
372:478, 1994) receptors, theDl and D2 domains of the o chain of interferon-y receptor 
(IFN-yRa) (Walter et al., Nature 376:230, 1995), the D2 domain of CD4 (Wang et al, 
Nature 348:41 1, 1990; Ryu et al., Nature 348:419, 1990), the two domains of tissue factor 
(MuUer et al., Biochemistry 33:10864, 1994; Harlos et al., Nature 370:662, 1994), the 
third fibronectin-type repeat of tenacin (Leagy et al., Science 258:987, 1992) and the D2 
domain of the chaperone protein PapD (Holmgren et al., Nature 342:248, 1989). The 
FBN-in topology differs from an Ig constant domain by a shift of strand D from one P- 
sheet (strands A, B, E and D) to the other (strands G, F, C, C), where it is defined as the 
C strand. Superposition of equivalent p-sheet core residues of the Dl and D2 domains 
in EBP gives an r.m.s. deviation of 2.3 A for 77 Ca pairs, which is significantly larger 
than the corresponding domain overlaps for hGHbp (1 . 1 A) and PRLR (0.8A), and reflects 
a difference in the subclass of fold between the two EBP domains. 

In Dl, a short a-helix (residues 10-20), precedes the first p-sandwieh that is better 
described as a hybrid of the FBN-ffl fold with an Ig fold (residues 24-1 14), rather than 
strict FBN-lll topology. In this h-type fold (Wang et al., Nature 348:41 1, 1990; Ryu et 
aL, Nature 348:419, 1990), the C strand is long and interacts first with strand C and then 
switches to interact with strand E (where C changes its designation to strand D) forming 
a four-on-four strand p-sandwich. Dl contains the two conserved disulfide bridges 
linking Cys " (PA) to Cys " (pB) and Cys" (pC) to Cys" (pE). The number of residues 
between the cysteine pairs that form the two disulfide bridges are 9 and 15 for EBP, 
compared to 9 and 10 in both GHR and PRLR. The longer connection between strands 
C and E enables the second half of strand C to become strand D. This h-type topology 
is not found in either of the two s-type GHR domains. A potential glycoylation site exists 
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on residue Asn 52 which is located towards the end of the loop region connecting the pB 
and PC strands. Although Asn 52 is not glycosylated in this bacterially expressed protein, 
an external cavity around the Asn 52 side chain could easily accommodate a carbohydrate 
moiety. 

A helical linker (residues 1 15-1 18) connects Dl to D2 (The 4>, ¥ torsion angles for the 
interdomain helical linker for lie'", Asn"*, Glu" 7 and Val u are -50° -27°, -76°, -21 °, - 
99°, 26°, and -15 1 °, 38° respectively.) and has been observed in other members of this 
receptor femily, hGHbp, PRLR, IFN-yRo and tissue factor. In EBP, the domain 
association is further restricted by a mixed assortment of hydrogen bonding, hydrophobic 
interactions and one salt bridge (between Arg" and Asp' a ) from 1 1 residues of Dl and 
12 residues of D2 with a total buried surface [The molecular surface areas buried by 
interaction were calculated using the program MS (Connolly, J. Appl. Crystallog, 16:439, 
1983) using a 1.7A probe sphere and standard atomic radii (as described in Davies, et al, 
Ann. Rev. Biochem. 59:439, 1990). There may be some discrepancies between values 
reported here and other (deVos et al., Scince 255:306, 1992) published values due to use 
of a different algorithm (Connolly) vs. Lee et al., J. Mol. Biol., 55: 379, 1971) and probe 
radii. For clarity all values reported here have been calculated in the same way for better 
comparison between the receptors] of 950 A 2 for the two domains. 

D2 (residues 1 19-220) folds into the standard FBN-111 (s-type) topology with one free 
cysteine and no disulfide bridges, consistent with GHR and PRLR that have three and two 
disulfide bridges, respectively, in D 1 but none in D2. After the a-helix linker, D2 begins 
with an irregular coil (residues 1 18-126) that contains Pro 124 which is conserved in the 
structures of hGHbp, PRLR, tissue factor and IFNy-Ra, and based on sequence 
alignment, in most class- 1 and dass-2 cytokine receptors (Bazan, Proc. Natl. Acad. Sci. 
USA 87:6934, 1990). This short coil ends with Gly 124 which has a positive <p (<j>, Y * 
52°,40°) consistent with the equivalent Ala 13 * and Ala IM torsion angles in hGHbp (<p, ¥ 
» 63 \68°) and PRLR (<J>, Y » 58°,38°). The Pro 124 region forms an analogous extended 
bulge conformation adjacent and parallel to a corresponding bulge containing the 
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WSXWS motif The WSAWS sequence forms a modified wide P-buige (Richardson, 
Adv. Prot. Chem. 34:167, 1981) and is located in an extended chain region immediately 
preceding the pG strand that would normally connect to the membrane spanning region 
oftheEPOR. 

The quaternary structure of the complex is composed of two peptides and two receptors 
that form a T-shapes assembly. A noncovalent peptide dimer interacts with two receptor 
molecules to generate an almost perfect 2-fold symmetrical arrangement. After 
superposition of D2 of the two EBP molecules in the dimer, the centers of mass of the 
two Dl domains are only 0.8 A apart, sufficient to perturb perfect two-fold symmetry. 
Separate superposition of the corresponding Dl and D2 of each receptor in the dimer 
results in r.m.s. deviations of 0.53 A (105 Dl Ca pairs) and 0.47 A (93 D2 Ca pairs). 

The cyclic EMP1 contains a single disulfide bridge between Cys M and Cys™, which links 
two short p-strands (residues 4-7 and 13-16) that are connected by a slightly distorted 
type 1 P-tuni [Pro p, ° 0+1) and Leu" 1 0+2) of the p-turn have <j>,T « -62°, -38° and -99°, 
-60°, respectively. The carbonyl oxygen of Leu Pn has a hydrogen bond to EBP distorting 
the Y value from its normal 0°±30° (B-2) in a standard type I P-tum.] consisting of 
residues Gly^-Pro^-I^^-Thr* 12 . Each peptide has a very close association with its 
other peptide partner and buries 320 A of its 1220 A 2 molecular surface m this interaction 
(Connelly, J. Appl. Crystallog. 16:439, 1983; Davies et al., Ann. Rev. Biochem. 59:439, 
1990; Richards, J. Mol. Biol. 55:379, 1971). Four hydrogen bonds between the 
mainchains of the two peptides results in formation of a four-stranded anti-parallel P- 
pleated sheet (Table 2). Two symmetric hyrdophobic cores are assembled by peptide 
dimerization and are comprised of the disulfide bridges and the side chains of Tyr^Phe" 
and Trp pu . The construction of each hydrophobic core resembles a box which places the 
aromatic rings of Phe^Tip" 3 and Tyr* 4 (from the other peptide) and the disulfide bridge 
(Cys M -Cys P15 ) at the corners. The two glycine residues at either end of the peptide are 
not structured. 
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The peptide dimer is embedded in a deep crevice between two EBP receptor molecules. 
A portion of each peptide monomer interacts with both receptor molecules. The binding 
sites of each EBP are practically identical due to the 2-fold symmetric interactions 
imposed on binding the peptide dimer. The four major contact areas on EBP come from 
segments on four loop regions (LI, L3, L5, L6) that connect strands A to B (Ll residues 
33-34) and F to G (L6 residues 90-94) in Dl and strands B to C (L5 residues 148-153) 
and F to G (L6 residues 203-205) in D2. The total buried molecular surfaces in the 
peptide-EBP assembly are 840 A 2 and 880 A 2 for the two peptides and EBP«s, respec- 
tively. The peptide-EBP interaction can be separated into distinct hydrophobic (67%) and 
polar (33%) areas. A hydrophobic core is formed between the peptide and receptor and 
comprises Phe*,Met ,S0 and Phe 205 from one EBP molecule and the peptide hydrophobic 
box consisting of Phe M and Trp p " from one peptide and Tyr" and Cys" 5 from the other 
peptide. The polar interactions are located mainly at the bottom of the binding crevice 
and are mainly with loop L5 in D2. Five of the six hydrogen bonds are between the 
mainchain of the p-turn residues Gh/'.Pro™ and Led" from one peptide with the 
mainchain and sidechain hydroxyl of conserved Tyi* 4 , which crosses over hs other 
peptide partner, to interact with loop L3 (Table 2). The EBP-EBP interaction makes a 
surprisingly minor contribution to the overall stability of the complex where the inter- 
receptor buried molecular surface is only 75 A 2 , contributed by Leu 17 ' and Arg" 1 from 
each receptor moiecuie. 

EMP1 is one of a family of sequences that contain several conserved residues, besides the 
cysteines ^SBS^JmamWmmmm Spm^. The most structurally significant 
of these consensus residues appear to be Tyi* 4 and Trp p ", which along with the disulfide 
bridge have a major contribution to the hydrophobic core of the peptide-peptide 
interaction. Moreover, these two aromatic residues play a pivotal role in peptide-receptor 
interaction and in receptor dimerization. 



Dimerization of EBP in Solution 
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To explore the interaction of EMP1 with EBP in solution we employed a 
bifunctionalsulphydryl reactive crosslinker DPDPB, [ 1 ,4-di-(2*-pyridyldithio 
propionamido) butane], in an attempt to stabilize a peptide-dependent dimeric structure. 
The choice of crosslinker was based on previous experiments with amine-reactive 
crosslinkers that were found to inactivate EBP. EBP contains a single free sulphydryi 
(Cys lil ) in D2 which is potentially reactive to crosslinking reagents (The DPDPB 
crosslinker itself does not inactivate the EPO binding potential of EBP nor the 
proliferative properties of EMP1). A dimeric EBP product is formed by co-incubation 
of EMP1, DPDPB and EBP. The amount of dimeric product increases with peptide 
concentration and no significant dimer product is observed in the absence of peptide. 
DPDPB-crosslinked products formed through disulfide-exchange reactions should be 
readily reversible by reduction as is seen for the covalently-linked EMP1 -mediated dimer. 
Furthermore, we have constructed a covalently-linked dimeric form of EMP1 that 
demonstrates increased biological potency (Johnson et al, in preparation). The Cys 111 
residues in D2 of the EBP dimer are 20.7 A apart (Sy-Sy distance) which approximates 
the 16 A length (and approximately 2 A in bond length at each end) of the DPDPB cross- 
linker. Thus EMP1 mediates formation of a soluble EBP dimer complex in solution 
consistent with the crystal structure. 

The WSXWS motif 

The WSAWS sequence (residues 209-213) corresponding to the WSXWS box occurs in 
a P-bulge (Richardson, Adv. Prot Chem. 34:167, 1981; Chan et al., Protein Science, 
1:1574, 1993) immediately preceding P-strand G in D2. Residues in this motif do not 
interact with Bgand, have no role in receptor-receptor interactions and are located on the 
opposite side of the receptor-receptor and receptor-ligand interface. The WSAWS box 
represents only a segment of a complex array of interactions that involves several other 
conserved side chains from the four-stranded P-sheet in D2. The indole ring systems of 
Tip 209 and Trp 212 point toward an external concave surface of the P-sheet and are only 
partially solvent exposed, whereas the Ala 211 side chain points directly out into solution. 
The amides and hydroxyls of both Ser 210 and Sei 213 form hydrogen bonds with the main 
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chain of residues 198 and 196 of adjacent strand F in a pseudo P-sheet type interaction 
that resembles a modified wide p-bulge (Richardson, Adv. Prot. Chem. 34:167, 1981; 
Chan et al., Protein Science, 2:1574, 1993) where the sidechain hydroxyl rather than the 
carbonyl oxygen makes the P-sheet interaction. The p-bulge architecture places the two 
Trp residues, which are spread four residues apart, on the same side of the P-sheet and 
not on opposite sides as in normal p-sheet or extended chain structures. The guanidinum 
group of Arg 1 * 7 from Strand F, the central residue (Richardson, Adv. Prot. Chem. 34: 167, 
1981; Chan et al., Protein Science, 2:1574, 1993) in the bulge, is positioned exactly 
between the two Trp indole rings to form an extended Jt-cation system (Kumpf et al., 
Science 261:1708, 1993. Tht center of the pyrrole ring of Trp **, the Nc of the Arg w and 
the center of the benzene ring of Trp 2n are positioned on a straight line with the three 
planes of the conjugated systems stacked parallel to each other at approximately 4 A 
spacing. In addition, the aliphatic portion of the Arg 1 " side chain has hydrophobic 
interactions with the indole ring of Trp 209 , completing the alternating stacking of two 
aromatic and two positively-charged amino acid residues. The side chain of Glu ,S7 forms 
a hydrogen bond with Arg 1 * 7 presumably to help orient the guanidinium group and add 
some specificity and stabilization to the system. 

It appears then that the linear WSXWS motif identified from sequence alignments of 
cytokine receptors represents only a component of a more complex conformational unit 
that contributes a significant structural feature to D2. Aromatic residues have previously 
been suggested to have a stabilizing effect and play a role as a folding nuclei in structures 
of antiparaDel p-sandwiches (Finkelstein et al., Protein Eng. 6:367, 1993). The amino- 
aromatic parallel stacking between the guanidinium group of arginine and the aromatic 
rings is a common feature in protein structures (Burley et al., Adv. Prot. Chem., 39: 125, 
1988; Flocco et al., J. Mol. Biol., 235:709, 1994), but a parallel triple stacking of n-cation 
systems is rare (Kim et al., Biochemistry 32:8465, 1993) although observed in other 
ciass-1 cytokine receptors, hGHbp and PRLR. 
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The structural equivalents of the WSXWS motif in hGHbp (YGEFS) and PRLR 
(WSAWS) are involved in an even more intricate and complex array of n-cation 
interactions. The it-cation system is extended in hGHbp and PRLR to include an 
additional aromatic residue (Trp ,K for hGHbp and Trp ,S6 for PRLR) from the loop region 
s that links PC and pC in D2 and a positively-charged residue (Arg 211 for hGHbp and 
Arg 147 for PRLR) that stacks between the Trp and the second aromatic residue. The 
additional Arg residue is contributed either from the pF strand as in hGHbp (Arg 211 ) or 
from pC as in PRLR (Arg 147 ); the glutamine residue that hydrogen bonds and orients the 
arginine also switches strands. Sequence alignments suggest that this Arg-Gln switch 
10 could be common to other members of the class- 1 cytokine receptor family. The 
extended it-cation system in hGHbp and PRLR consists of five positively charged and 
three aromatic residues stacked in an alternating order which comprises of Lys 2 ' 5 , Tyr 222 , 
Arg 2 ' 3 , Phe 225 , Arg 211 , Trp 1 **, Lys ,w for hGHbp and Lys" 5 , Trp 1 ", Arg"\ Trp 194 , Arg 147 . 
Tip 1 *, Lys 14 * for PRLR. The first aromatic-Arg-aromatic trio are approximately 4A apart, 
is as in EBP, but the second system is stacked closer together at approximately 3.6 A 
spacings consistent with 7t-7t interaction (Burley et al., Adv. Prot. Chem., 39:125, 1988; 
Flocco et al., J. Mol. Biol., 235:709, 1994). The outer lysines also use the aliphatic 
portions of their side chains to form hydrophobic interactions with the aromatic rings. 
Based on sequence alignments with other members of the class- 1 cytokine receptor 
20 superfamily, such structurally extended rc-cation systems could exist in human 
thrombopietin, EL-6 and ciliary neurotrophic factor receptors, and in human IL-4 receptor 
based on structural modeling (Gustchina et al., Proteins 21:140, 1995). Although EFN- 
YRo and tissue factor do not have a WSXWS motif; the corresponding sequences TTEKS 
(residues 213-217) for TFN-yR* (Walter et al., Nature 376:230, 1995) and KSTDS 
25 (residues 201-205) for tissue factor (Muller et al., Biochemistry 33:10864, 1994; Harlos 
et al., Nature 370:662, 1994), maintain a very similar p-bulge. The consensus sequence 
among these five x-ray structures indicates that a serine or threonine in positions 2 and 
5 maintain a common set of hydrogen bonds between their side chain hydroxyls and the 
mainchain of the neighboring strand. Only in hGHbp is there no hydroxyl-containing 
30 residue in position 2, but Ser 226 still maintains the equivalent interaction. A Set 324 to Ala 
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mutation abrogates hGHR binding to hGH, and its expression on the cell surface is 
drastically reduced (Baumgartner et al., J. Biol. Chem., 269: 29094, 1994). In GM- 
CSFRo and E--2Rp, point mutations of the serine residues cause a substantial decrease 
in cell surface expression but little or no effect on ligand binding (Ronco et al., J. Biol. 
Chem. 269:277, 1994; Miyazaki et al., EMBO Journal 10:3191, 1991). 

Conservation of the WSXWS motif in EPOR or its equivalent in other members of the 
class 1 cytokine receptors has been proposed to be essential for biological activity and 
was thus assumed to be part of the receptor binding site (Yoshimura et al., J. Biol. Chem. 
267:11619, 1992; Quelle, Mol. Cell. Biol. 12:4553 1992). For EPOR, a systematic study 
of 100 mutations of the WSAWS sequence demonstrates that most of the mutations of the 
two tryptophan and serine resulted in molecules that did not reach the cell surface but 
were retained in the endoplasmatic reticulum (Hilton et al., Proc. Natl. Acad. Sci. USA 
92:190, 1995; Hilton et al., J. Biol. Chem. 271:4699, 1996). Furthermore, an Ala" 1 to 
Glu mutation in the WSAWS sequence resulted in better transportation from the ER to 
the Golgi and a 3-5 fold increase of the number of EPOR molecules on the cell surface 
compared to the wild-type (Hilton et at, Proc. Natl. Acad. Sci. USA 92:190, 1995; HiKon 
et al., J. BioL Chem. 271:4699, 1996). These results support our conclusion that the 
WSXWS sequence plays an important role in the structure and folding of D2 in EPOR 
and other related receptors. 

Comparison with other cytokine-receptor complex structures 
The overall quaternary structure of the peptide-EBP complex substantially from the 
equivalent arrangement in the hGH-hGHR complex. The non-symmetric nature of the 
angle four-helix-bundle structure of the growth hormone ligand results in an asymmetric 
homo-dimerization of the receptor that corresponds to a 159° rotation between receptors 
compared to the almost perfect 2-fold (180°) rotation for the EBP-peptide complex. The 
tertiary arrangement of domains within EBP and hGHbp is also somewhat different. 
When the equivalent EBP and hGHbp D2 domains are superimposed on each other, their 
corresponding Dl domains differ by a 12° rotation and a 4.3A translation. 
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The mechanism of hGH binding to its receptor has been well studied (Wells, Curr. Opin. 
Cell Biol. 6:163, 1994; Clackson et al., Science 267:383, 1995) and is sequential. Initial 
high affinity (nM) binding of the hormone with one receptor results is a buried surface 
of 1 130 A 2 on the receptor. The second hGHbp2 has a substantially smaller interface 
(deVos et al., Science 255:306, 1992) with the second binding site on hGH and interacts 
only with the preformed 1 : 1 complex to generate buried surface areas of 740 A 2 with hGH 
and 440 A 2 with the first hGHbpl (deVos et al., Science 255:306, 1992; (Wells, Curr. 
Opin. Cell Biol. 6:163, 1994; Clackson et al., Science 267:383, 1995). The binding 
determinants of each hGHbp are comprised of the six recognition loops (L1-L6), three 
of which (L1-L3) come from one end of the p-sandwich structure in Dl, one from the 
interdomain linker and two from D2. 

Although these two receptor complexes, EBP-EMP1 and hGH-hGHbp, have different 
dimeric arrangements, which probably in this case represent differences in the size and 
shape of the natural versus synthetic ligand, both receptors share equivalent ligand 
recognition loops, LI, L3 f L5 and L6 for the EBP and LI to L6 for the hGHbp. A non- 
active PRLR, complexed with only one molecule of hGH, also uses the same contact 
loops (LI to L6) (Somers et al., Nature 372:478, 1994). Based on similarity of the ligand 
recognition sites in hGHbp and PRLR, one would expect that the binding site of EBP, 
when its natural EPO iigand is bound, would extend to include two additional loops, L2 
and IA, that comprise residues 59-63 (L2) between strands C to C, and residues 110-118 
(L4) from the carboxyl end of PG in Dl and the interdomain linker. These six loops in 
EBP, hGHbp and PRLR area in structurally equivalent positions but vary in size, amino 
acid composition and conformation although the interacting portions of each loop (side 
or tip) remain similar, LI, L2, L3, L5 interact mainly with their tips and L6 with its side. 
In EBP, the L5 loop is three residues shorter than in hGHbp and PRLR, where the L6 
loop is three and four residues longer than in hGHbp and PRLR, respectively. The L2 
loop also varies (6 to 10 residues) among the three receptors but in EBP does not 
participate in peptide binding, and in hGHbp is partially disordered, although it does 
contact the hormone. In one respect, this situation is similar to the complementarity- 
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determining regions (CDR's) in antibodies, where changes in length and sequence of the 
six binding loops impose specificity for different antigens, whereas the framework itself 
remains constant (Wilson et al., Ciba Foundation Symposium. Wiley, Chichester, 1991, 
Vol. 159, p. 13). 

It has been shown for the hGH-hGHbp complex that only a subset of 9 out of 33 
interacting residues that make up the structural epitope of the receptor constitute a 
functional epitope or hot spot (Wells, Curr. Opin. Cell Biol. 6: 163, 1994; Clackson et al., 
Science 267:383, 1995) where high affinity binding interaction takes place. This reduced 
epitope is substantially smaller than the structural epitope and is comprised from residues 
(Aig*. Glu 44 , De ,<B , Trp ,M . De 105 , Pro' 06 , Asp ,6J , and Trp 169 ) which are located in contact 
loops LI, 13 and L5 with the most significant contribution (>4.5 kcal/mol) coming from 
two aromatic residues (Trp 104 and Trp" 9 ) in L3 and L5 (Wells, Curr. Opin. Cell Biol. 
6:163, 1994; Clackson et al., Science 267:383, 1995; Wells, Proc. Natl Acad. Sci. USA 
93:1, 1996). In EBP, Phe" is equivalent to Trp 104 in hGHbp, as suggested previously 
(Wells, Curr. Opin. Cell Biol. 6:163, 1994; Clackson et al., Science 267:383, 1995; Wells, 
Proc. Natl Acad. Sci. USA 93:1, 1996; Jolliffe et al., Nephrol. Dial. Trans. 10:suppl. 2, 
28, 1995), but there is no homologous residue to Trp 1 * 9 in the shorter L5 loop. In the 
EBP-EMP1 complex, the Phe M peptide aromatic side chain occupies the equivalent 
position of the Trp ,w side chain in hGHbp. One can assume that when EPO binds to its 
receptor, the hormone may provide an aromatic residue to the hydrophobic core of the 
binding interface and/or the L6 loop in EBP may play a more significant role in the 
hormone binding than in hGHbp, since it is 3 residues longer and contains the aromatic 
Phe" 5 . 

In these three class- 1 receptor structures, some loops are disordered which are in D2 for 
EBP for EBP (residues 164-166 in EBP1 and 133-135 in EBP2) and in Dl for both 
hGHbp (residues 55-58, 73-78 for hGHbpl and 54-60, 73-75 for hGHbp2) and PRLR 
(residues 31-33, 84-86). Otherwise, these three class- 1 cytokine receptors do not differ 
greatly in their over all tertiary structures; Dl and D2 have broadly similar general 
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arrangement in all three receptors such that the angle between the long axes of the two 
domains is approximately 90 degrees. I tis this arrangement of domains that allow these 
particular L1-L6 loops to be available for the recognition and binding of ligands. In a 2:2 
complex between IFN-y and its dass-2 receptor IFN-yRa, Dl and D2 are related by a 

s 125 degree angle, which elongates the receptor and restricts the binding determinants that 
can be used for interaction with hormone; the LI loop now becomes buried in the D1-D2 
interface, although the other five loops (L2-L6) are still available for ligand interaction. 
This elongated interdomain arrangement is also observed in tissue factor (Muller et al., 
Biochemistry 33:10864, 1994; Harios et al., Nature 370:662, 1994) which has a distant 

io relationship to the cytokine receptor superfamily. 

A mutational analysis of the EBP molecule indicates that the most crucial amino acid 
residue for binding EPO is Phe" in the L3 loop (Jolliffe et al., Nephrol. Dial. Trans. 
10:suppl 2,28, 1995). The Phe93Ala mutant shows an increase int he IC* compared to 

15 the wild-type by a factor of approximately 1000, whereas other mutants (Ser91Ala, 
Ser92Ala, Val94Ala, Metl50Ala and Hisl53Ala) show small relative increases in teh 
ICjo of only 2.5-12.5 fold). The side chain of Phe M buries 66 A 2 of molecular surface, 
which is the highest among interacting side chains. In hGHbp, the corresponding 
Tipl04Ala mutation results in an increase in the by a factor of more than 2,500 

» compared to the wiia-iype indicating xne equivalent uiipunoin* «i «v».w-». ... 

binding and its key contribution to the hydrophobic core of the functional epitope (Wells, 
Curr. Opin. Cell Biol. 6:163, 1994; Clackson et al., Science 267:383, 1995; Bass et al. 
Proc. Natl. Acad. Sci. USA 88:4498, 1991). 

The role of dimerization on signal transduction 
25 In the EBP-EMP1 complex structure, we surprisingly observe that a peptide, unrelated 
in sequence and probably in structure, to the natural ligand, can induce a biologically 
active dimerization of EPO receptor that promotes signal transduction and cell 
proliferation. Comparison of three dass-1 cytokine receptor complexes, whose structures 
have been determined so far, suggests that when the natural EPO hormone, which is 
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proposed to have a structure of a four-helix bundle (Boissel et al., J. Biol. Chem. 
268:15983, 1993), induces receptor dimerization, it is more likely to resemble the hGH- 
hGHbp assemblage. This would suggest that more than one mode of productive 
extracellular dimerization is permissive for intracellular dimerization of the cytoplasmic 
domains with two JAK2 molecules in order to initialize the cascade of events that 
produces the biologically relevant signal (Dile et al., Seminars in Immunology 5:375, 
1993; Klingmuller et al., Cell 80:729, 1995). The peptide-EBP structure would then 
represent only one possible dimeric arrangement that promotes signal transduction. 

Mutant EPOR molecules, containing a single Arg to Cys mutation (Arg 130 in human and 
Arg 129 in murine), have been shown to form biologically active dimers in the absence of 
EPO (Yoshimura et al., J. Biol. Chem. 267:11619, 1992); Watowich et al., Proc. Natl. 
Acad. Sci. USA 89:2140, 1992; Watowich et al., Mol. Cell. Biol. 14:3535, 1994), 
suggesting that extracellular recptor homo-dimerization may be sufficient in itself for 
signal transduction It has been shown in another system (Spencer et al.. Science 
262: 1019, 1993) that activation of a specific set of trasncription factors can be induced 
by the chemical crosslinking of cytoplasmic domains of modified cell membrane 
receptors that do not contain the extracellular and transmembrane domains. These 
receptors are not related to the cytokine receptor superfamily but illustrate that 
oiigomerication piays a key role in activation of the receptor, and that the main functional 
role of the extracellular, Ugand-binding domain is to allow (in the presence of ligand) 
dimerization or oligomerization and induce similar association of the cytoplasmic 
domains. 

Mutageneses experiments originally suggested a role for the WSXWS motif in this cell 
signalling process (Y oshimura et al., J. Biol. Chem 267: 1 1619, 1992; QueBe et al. Mol. 
Cell. Biol. 12:4553, 1992; Chiba et al., Biochem. Biophys. Res. Comm. 184:485, 1992) 
possibly by promoting receptor homo-dimerization. However, truncation mutants of 
EPOR (Miura et al., Arch. Biochem. Biophys. 306:200, 1993) do not confirm this role for 
the WSXWS motif. The EBP-EMP1 complex structure shows that the WSXWS motif 
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of the EPOR, as for the hGH-hGHbp complex (deVos et al., Science 255:306, 1992) is 
located on the opposite face of the molecule from the receptor dimerization. In the 
absence of unliganded structures for the extracellular domains of EPOR, hGHR and 
PRLR, it is not possible to determine whether any conformation change occurs on ligand 
5 binding that would involve the WSXWS box. Apart from bring a striking structural 
feature in D2, and its obvious proximity to the membrane spanning domain, one cannot 
rule out posable interactions of this region with some other cell surface molecules that 
are involved somehow in the signal transduction process. 

Towards design of small molecule mimetics 

10 The structure of the EMP1 dimer demonstrates that a peptide considerably smaller than 
the natural hormone can act as an agonist and induce the appropriate biological response. 
The peptide can be assumed to form a substantially smaller contact interface than the 
natural hormone with the receptor. The peptide binding site in EBP forms an almost flat 
surface, which is mainly hydrophobic in nature, without any cavities or charged residues 

15 that are normally essential for the specific targeting of small molecule ligands to a 
receptor binding site. The hGHbp study (Wells et al. t Science 267:383, 1995; Wells, 
Proc. Natl. Acad. Sci. USA 93:1, 1996) shows that only a small part of the observed 
structural binding site, the so-called functional epitope (sUEJS), contributes most of the 
binding energy and strongly implied that a "minimized" hormone designed to interact 

20 with this ate could form sufficient interactions to activate the receptor. Furthermore, the 
limited site of interaction of the small agonist peptide with the EBP corresponds almost 
exactly to the smaller functional epitope derived from alanine scanning of hGH and 
hGHbp. Thus, by a different approach, we have arrived at the similar conclusion that a 
small number of key interactions can contribute to a functional epitope on a receptor. 

25 Understanding of this simplified interaction surface can be now combined with further 
mutational studies to assist in identifying the most crucial residues in the functional 
epitope, and consequently provide a more practical target for drug design. 

Data Collection, MIR and Refinement Statistics 
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Thc crystallographic data is summarized in Table 1. Native crystallographic data were 
collected on a Siemens multiwire area detector mounted on an Elliott GX-18 generator, 
operating at 40kV and 55mA, with a crystal-to-detector distance of 120mm. Two 
derivative data sets were collected on a MAR image plate mounted on a Siemens 
generator operating at 50kV and 80mA, with crystal-to-image plate distance of 150mm. 
Data were integrated, scaled and reduced using the programs XENGEN (Howard et al., 
J. App. Cryst. 20:383, 1987) for the native data and DENZO/SCALEPACK (Otwinowski 
et al., SERC Darsbury Laboratory, Warrington, 1993) for the derivative data. Initial 
multiple isomorphous replacement anomalous scattering (MIRAS) phases were calculated 
to 3.1 A using the program package PHASES (Furey, American Crystallographic 
Association Fortieth Anniversary Meeting, New Orleans, LA, 1990) with a mean figure 
of merit of 0.64 (25.0-3.1 A). Phases were refined in PHASES using the solvent 
fattening protocol to a mean figure of merit of 0.92 (25.0-3. 1 A). The quality of the map 
was generally good and most of the complex structure (94%) could be fitted using the 
graphics program O (Jone et al., Acta Crystallogr A47:l 10, 1991). The register of the 
amino acid residues was verified from the positions of the two disulfide bridges in Dl, 
and the positions of the two Hg's from the mercury acetate derivative that were correctly 
assumed to bind to the free Cys'" residue, the peptide interpretation was verified from 
another data set from a complex between EBP and an iodinated peptide (Tyr w was 
substituted for/Modo-Phe), which diffracted to 3.3A resolution, that in difference Fourier 
(F jodo -F 1- )a MnMS gave a dear indication of the location of the iodine atoms. The structure 
was refined using the slow-cooling protocol in X-PLOR 3.1(Brunger et al., Acta 
Crystallogr A46:585, 1990; Brunger, X-PLOR, Version 3.1: A System for X-ray and 
NMR, Yale Univ. Press, New Haven, CT, 1992) and rebuilt using Fo-Fc, 3Fo-2Fc and 
SIGMAA(Read, Acta Crystallogr. A42:140, 1986) weighted electron density maps. After 
every two cycles of refinement, a set of simulated annealing omit maps (7- 1 0%) to reduce 
model bias was calculated and the entire structure rebuilt. After several cycles of 
refinement, individual temperature factors were calculated and after 10 cycles of 
refinement and model building, the R-value was 0.21 for 8.0-2.8 A data with F>lo 
(13,984 reflections). The average thermal parameters for receptor L receptor II and the 
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peptides are 10.5A 2 , 12.3A and 10.7A respectively. Only one non-glycine residue 
[Ami 164 in EBP2], located in a loop region in Dl, is in a disallowed region in the 
Ramachandran plot. No solvent molecules were included in the model due to the 
moderate resolution (2.8 A) of the structure determination. 

5 Binding Contacts 

Binding contacts are summarized, in part, in Table 2: Hydrogen bond interactions in the 
binding site of the EBP-EMP1 complex. Due to the symmetrical nature of the complex, 
peptide- 1 and peptide-2 have equivalent interactions with the two EBP molecules. The 
hydrogen bond interactions were analyzed using HBPLUS (McDonald et al., J. Mol. Biol. 

io 238:777, 1994), based upon both distance (3.9 A cutoff) and geometrical considerations. 

A number of embodiments of the present invention have been described. Nevertheless, 
it will be understood that various modifications may be made without departing from the 
spirit and scope of the invention. Accordingly, it is to be understood that the invention 
is not to be limited by the specific illustrated embodiment, but only by the scope of the 
is appended claims. 
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WARNING RESIDUES 21-23 (521-522) and 164-166. 633-6:* 
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ATOM 305 0 GLY 49 67.294 42.396 19.152 1.00 3.06 = 

ATOM 306 N PRO 50 67. 274 41.234 19.732 1.00 4.49 " 

ATOM 307 CD PRO 50 67.104 39.790 19.536 1.00 2.00 0 

ATOM 30B CA PRO 5C 67.519 41.540 21.125 1.00 2.99 * 

ATOM 309 CB PRO 50 67.545 40.158 21.762 1 . 00 3.97 5 

ATOM 310 CG PRO 50 67.935 39.264 20.632 1.00 2.00 5 

ATOM 311 C PRO 50 68.837 42.307 21.315 1.00 4.67 

ATOM 312 O PRO 5CT 68.962 43.122 22.217 1.00 7.36 5 

ATOM 313 N GLY 51 69.787 42.100 20.403 1.00 5.60 " 

ATOM 314 CA GLY 51 71.059 42.792 20.477 1.00 2.83 * 

ATOM 315 C GLY 51 70.991 44.263 20.103 1.00 3.36 ? 

ATOM 316 O GLY 51 72.025 44.910 19.956 1.00 5.85 r 

ATOM 317 N ASK 52 69.792 44.815 19.968 1.00 2.00 ' 

ATOM 318 CA ASN 52 69.670 46.211 19.604 1.00 2.00 ~> 

ATOM 219 CB ASN 52 68.527 46.401 18.633 1.00 3.33 i 

ATOM 320 CG ASN 52 68.768 47.534 11.666 1.00 7.97 6 

ATOM 321 OD1 ASN 52 69.233 47.311 16.538 1.00 12.55 s 

ATOM 322 ND2 ASN 52 68.442 48.753 18.079 1.00 10.05 7 

ATOM 323 C ASN 52 69.505 47.144 20.797 1.00 3.59 6 

ATOM 324 O ASN 52 69.524 48.359 20.638 1.00 2.00 3 

ATOM 325 N TYR 53 69.286 46.573 21.985 1.00 5.70 " 

ATOM 326 CA TYR 53 69.148 47.350 23.229 1.00 5.58 * 

ATOM 327 CB TYR 53 67.707 47.412 23.770 1.00 5.70 6 

ATOM 328 CG TYR 53 66.615 47.547 22.768 1.00 4.84 i 

ATOM 329 CD1 TYR 53 66.067 46.414 22.188 1.00 6.02 * 

ATOM 220 CE1 TYR 53 65.080 46.495 21.282 1.00 5.48 - 

ATOM 331 CO 2 TYR 52 66.123 4e.789 22.405 1.00 5.30 ? 

ATOM 232 CE2 TYR 53 65.118 48.891 21.495 1.00 6.56 * 

ATOM 233 CI TYR 53 64.598 47.729 20.929 1.00 8.17 6 

ATOM 334 OH TYR 53 63.590 47.767 19.987 1,00 12.54 5 

ATOM 335 C TYR 53 69.943 46.667 24.308 1.00 3.83 i 

ATOM 336 O TYR 53 70.176 45.465 24.245 1.00 4.09 c 

ATOM 337 N SEP. 54 70.257 47.428 25.346 1.00 3.81 

ATOM 338 CA SEP. 54 70.977 46.921 26.501 1.00 6.59 

ATOM 339 CB SER 54 72.296 47.682 26.710 1.00 8.72 

ATOM 340 OG SEP. 54 73.405 46.867 26.341 1. 00-13.32 

ATOM 341 C SER 54 70.077 47.035 27.726 1.00 4.52 

ATOM 342 0 SER 54 69.495 48.089 27.982 1.00 5.58 

ATOM 343 N PHE 55 69.931 45.926 28.443 1.00 3.52 

ATOM 344 CA PHE 55 69.095 45.876 29.641 1.00 3.38 

ATOM 345 CB PHE 55 68.141 44.676 29.574 1.00 6.11 

ATOM 346 CG PHE 55 67.052 44.700 30.596 1.00 6.96 

ATOM 347 CD1 PHE 55 65.741 44.865 20.214 1.00 10.17 

ATOM 348 C02 PHE 55 67.330 44.547 31.929 1.00 8.73 

ATOM 349 CE1 PHE 55 64.713 44.879 21.154 : . 00 12.64 

ATOM 250 CE2 PHE 55 66.317 44.561 32.874 1.00 8.02 

ATOM 251 C2 PHE 55 65.007 44.728 32.490 1.00 8.93 

ATOM 352 C PHE 55 69.942 45.779 30.891 1.00 2.90 

ATOM 253 O PHE 55 70.476 44.710 21.200 1.00 2.00 

ATOM 354 N SER 56 70.083 46.892 21.595 1.00 2.00 

ATOM 255 CA SER 56 70.854 46.862 22.818 1.00 5.40 

ATOM 356 CB SER 56 72.159 47.651 22.681 1.00 6.39 

ATOM 357 OG SER 56 71.950 48.939 22.162 1.00 9.05 

ATOM 258 C SER 56 70.089 47.214 24.108 1.00 8.43 

ATOM 359 O SER 56 69.0B0 47.943 34.098 1.00 7.21 

ATOM 360 N TYR 57 70.546 46.635 25.213 1.00 7.52 

ATOM 361 CA TYR 57 69.915 46.856 36.495 1.00 8.61 

ATOM 362 ' CB TYR 57 69.091 45.621 26.883 1.00 3.65 

ATOM 363 CG TYR 57 69.863 44.324 27.004 1.00 4. 17 

ATOM 364 CD1 TYR 57 70.254 43.825 28.254 1.00 2.00 

ATOM 265 CE1 TYR 5^ 70.865 42.587 28.384 1.0C 2.00 

ATOM 366 CD2 TYR 57 70.115 43.560 25.894 1.00 4.11 

ATOM 267 CE2 TYR 5' 70.729 42.316 26.020 1.00 2.48 

ATOM *68 CZ TYR 57 71.096 41.834 37.260 1.00 2.00 

ATOM 359 OK TY?. «7 71.672 40.591 27.334 1.00 2.00 

ATOM 270 C TYR l m 70.885 47.252 27.616 1.00 10.04 

ATOM 271 '*> TYR 57 72.092 47.181 37.466 1.00 11.97 

ATOM 372 N GLN 11 70.353 47.781 36.705 i.OC 11.03 

ATOM 373 CA GLN 58 71.181 48.142 39.832 1.00 9.11 

ATOM 374 CB GLN 5c- 71 .613 49.592 39.754 1.00 10.64 

ATOM 375 CG GLN 56 72.167 50.091 41.062 1.00 13.28 

ATOM 376 CD GLN 5£ 72.827 51.436 40.954 1.00 13.57 

ATOM 277 OE1 GLN 56 72.165 52.472 40.844 l.CC 15.62 

ATOM 378 NE2 GLN 56 74.147 51.425 41.026 1.00 14.36 

ATOM 279 C CLN 56 70.430 47.886 41.123 I . 00 6.05 

ATOM 3B0 O GLN 56 69.489 48.582 41.468 :.0C 7.60 

ATOM 281 N LEU 59 70.777 46.796 41.769 l.CC 8.41 

ATOM 282 CA LEU 59 7&.183 46.435 42.036 l.CC 6.43 
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50.668 
51.105 
49.480 
48.638 
47.527 
48.130 
46.623 
45.471 
47.931 
48.234 
47.000 
46.187 
46.758 
47.a2*» 
47.787 
49.13C 
49.850 
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26.557 
28.480 
27.476 
26.511 
25.461 
20.774 
30.794 
21.835 
23.174 
23.296 
34.613 
25.046 
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26.662 
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24.214 
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25.742 
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25.783 
26.730 
37.840 
26.55C 
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29.510 
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40.601 
42.094 
40.093 
40.542 
40.581 
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41.686 
40.688 
29.426 
40.315 
29.468 
43.001 
43.572 
43.463 
44.645 
45.860 
46.555 
41.652 
46.106 
46.891 
47.938 
44.330 
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:.00 10.61 
:.00 12.42 
;.00 12.48 
1.00 14.44 
1.00 8.52 
1.00 9.59 
1.00 10.99 
1.00 14.45 
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..00 
..00 
..00 

:.oo 
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8.05 
8.40 
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8.80 
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5.20 

6.80 

5.60 

6.45 

7.39 
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7.16 

9.02 

2.78 

2.00 
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4.22 
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2.34 

3.21 

2.00 
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6 
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60.261 
61.439 
^60.953 
62.333 
63.731 
59.10" 
58.294 
59.012 
57.936 
58.317 
58.382 
59.470 
57.806 
56.589 
55.512 
56.664 
55.480 
53.732 
55.958 
56.160 
56.648 
55.816 
5!.. 116 
54.440 
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42.190 
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42.492 
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42.342 

43.822 

44.421 

44.414 

42.722 

42.223 

43.852 

44.614 

46.113 

46.606 

46.090 

48.601 

46.751 

44.321 

45. 12? 



43.871 
43.883 
43.984 
42.650 
42.966 
44.7*1 
44.3C2 
45.98S 
46.895 
43.37C 

48.07C 
49.33: 
46.54" 
46.879 
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4!i.436 
45.534 
46.93" 
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44.025 
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1.00 
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:.oo 
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:.oo 
..cc 
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..oo 
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:.oc 
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3.64 
4.23 
2.00 
2.23 
3.56 
6.91 
4.63 
5.21 
5.33 
5.r4 
3.06 
8.46 
S.r2 
4.52 
8.36 
7.65 
6.11 
12.42 
15.01 
DO 16.35 
00 11.71 
*.25 
*.90 
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929 CB ALA 



930 
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ALA 
ARG 



941 
942 
943 
944 



949 
950 
951 



932 N 

933 CA ARG 

934 CB ARG 

935 CG ARG 

936 CD ARG 
931 NE ARC 
936 C2 ARG 

939 NH1 ARG 

940 NH2 ARG 
C ARG 
0 ARG 
N LEU 
CA LEU 

945 CB LEU 

946 CG LEU 

947 CD1 LEU 

948 CD 2 LEU 
C LEU 
0 
N 

952 CA ALA 

953 CB ALA 
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955 0 

956 N 

957 CA ASP 
956 CB ASP 

959 CG ASP 

960 OD1 ASP 

961 OD2 ASP 
C ASP 
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N CLU 

965 CA GLU 

966 CB GLU 
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968 CD GLU 
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994 CA VAL 
95 CB VAL 
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130 
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128 
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128 
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128 
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32.693 
32.730 
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31.288 
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32.108 
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34.840 
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29.932 
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27.813 
26.640 
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25.035 
24.252 
27.799 
27.702 
27.968 
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26.638 
25.719 
26.553 
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25.868 
26.187 
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26.894 
24.655 
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25.310 
24.742 
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26.589 
28.003 
28.980 
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24.784 
25.842 
23.646 
23.375 
24.435 
25.623 
22.990 
22.366 
23.287 
22.945 
23.765 
23.225 
25.071 
25.992 
■26.594 
25.768 
26.049 
24.484 
24.014 
24.944 
27.081 
27.426 
27.547 
28.598 
28.391 
29.725 
27.444 
29.691 
29.960 
30.872 
32.194 
32.535 
33.781 
31.415 
33.249 
33.33: 



21.680 
19.876 
18.760 
20.106 
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18.310 
16.979 
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17.763 
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21.83? 
21.981 
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22.301 
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23.862 
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4.92 
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4.60 • 
3.40 " 
9.26 • 
13.86 
10.41 
10.20 
10.37 
11.25 
14.78 
9.47 
7.74 
9.05 
4.98 
2.00 
2.23 
2.00 
2.0C 
2.00 
4.55 
2.47 
2.13 
8.93 
6.63 
11.16 
11.60 
12.78 
13.18 
12.67 
14.82 
12.59 
12.29 
16.27 
23.12 
27.82 
29.07 
27.91 
11.37 
11. £6 
11.02 
12.53 
11.49 
9.49 
12.12 
3.78 
12.44 
7.47 
7.05 
8.24 
9.46 
6.68 
9.21 
10.20 
3.47 
9.61 
13. :2 
13.22 
10.91 
9.60 
10.32 
11.10 
3.62 



6.52 



..30 3.18 
1.30 11.91 
1.90 12.22 
1.30 14.15 
L.00 13.52 
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ATOM 1709 CB LEU 311 
ATOM 1710 CG LEU 311 
ATOM HI! CZ1 LEU 311 
ATOM 1712 C22 LEU 31i 
ATOM H13 C LEU 311 
ATOM 1714 C LEU 311 
ATOM 1715 M THP. 312 
ATOM 1716 CA THR 212 
ATOM 1717 CB TUP. 312 
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ATOM 1723 CA TRP 213 
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ATOM 1725 CG TRP 313 
ATOM 1726 C32 TRP 313 
ATOM 1727 CI 2 TRP 313 
ATOM 1728 CI3 TRP 313 
ATOM 1729 CD1 TRP 313 
ATOM 1730 NZ1 TRP 313 
ATOM 1731 CZ2 TRP 313 
ATOM 1732 CZ3 TRP 313 
ATOM 1*723 CH2 TRP 313 
ATOM 1734 C TRP 313 
ATOM 1735 1 TRP 313 
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ATOM 1745 C CYS 315 
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ATOM 1747 CB CYS 315 
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ATOM 1749 M LYS 316 
ATOM 1750 CA LYS 316 
ATOM 1751 C3 LYS 316 
ATOM 1752 CG LYS 316 
ATOM 1753 CD LYS 316 
ATOM 1754 CE LYS 316 
ATOM 1755 SZ LYS 216 
ATOM 1756 Z LYS 316 
ATOM 1757 : LYS 316 
ATOM 1758 K PRO 217 
ATOM 1759 CD PRO 21^ 
ATOM 1760 CA PRO 317 
ATOM 1761 CB PP.0 31 1 
ATOM 1762 CG PRO 217 
ATOM 1763 C PRO 317 
ATOM 1764 : PRO 217 
ATOM 1765 GLK 318 

ATOM 1766 ' CA GLN 318 
ATOM 1767 CB GLN 318 
ATOM I76e CG GLK 318 
ATOM 1769 CD GLN 3ie 
ATOM 17 TO C-El GLK 218 
ATOM 1771 SE2 GLK 318 
ATOM 1772 C GLN 318 
ATOM 1772 I GLN 21? 
ATOM 1774 THR 403 

ATOM 1775 CA THR 403 
ATOM 1776 Cr TK? 403 
ATOM 1777 OG1 TK? 402 
ATOM 1773 CG2 TKR 402 
ATOM 1779 C THR 402 
ATOM 1780 : THR fl02 
ATOM 1781 :: TY? 404 
ATOM 1782 CA TYP. 404 
ATOM 1782 C5 TYR 404 
ATOM 1784 CG TYR 404 
ATOM 1785 CC1 TYR 404 
ATOM 17B6 CE1 TYR 404 



53.834 28.437 17.080 1.00 4 . 3*4 
52.837 27.277 57.127 1.00 5 . *4 
51.524 27.802 57.619 1.00 5.26 
52.684 26.580 i5.78l ; . 00 1.22 
54.346 30.580 56.002 1.00 4.35 
53.586 31.411 55.540 1.00 4.76 
55.390 30.898 16.741 1.00 5.21 
55.680 32.285 57.075 1.00 7 . 62 
54.772 32.732 58.256 1.00 7.5? 
54.898 34.144 58.476 i.00 1C.56 
55.115 31.951 59.526 I. 00 6.26 
57.168 32.342 57.444 i.00 6.58 
57.868 31.352 57.215 1.00 7.15 
57.641 33.477 57.979 1-00 5.96 
59.050 33.653 13.382 1.00 6.02 
59.294 35.032 58.992 1.00 2.13 
58.937 36.213 58.158 1.00 5.28 
59.685 36.779 57.066 1.00 6.24 
59.026 37.968 56.682 1.00 4.28 
60.845 36.403 56.382 1.00 3.66 
57.883 37.044 58.359 1.00 4.69 
57.933 38.103 57.488 1.00 5. CO 
59.484 38.784 55.656 1.00 4.75 
61.301 27.222 55.350 1.00 6.62 
60.618 38.398 55.004 1.00 5.72 
59.605 32.619 59.389 :.00 3.28 
58.931 32.186 '.0.359 l.OC =r.40 
60.847 32.216 59.139 1.00 ~.4C 
61.533 31.286 60.013 i.00 £.£7 
62.145 20.110 59.249 1.00 4.32 
62.868 29.204 60.190 1.00 5.24 
61.099 29.333 58.536 1.00 3.95 
62.637 32.192 60.499 I. 00 8.62 
63.512 32.562 59.730 1.00 11. C3 
62.516 32.672 61.724 1.00 9.61 
63.524 33.567 62.250 1.00* 10.57 
64.417 32.948 63.293 1.00 10.10 
64.191 31.825 63.748 1.00 11.89 
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65.698 39. Hi 55.366 l.OC 2.26 
65.223 38. 96* 52.910 1 , CO J. 42 
65.036 37.70-: 52.377 ..DC 2.72 
64.723 37.54: 5;?. 042 l.OC 2.01' 
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CLAIMS 

What is claimed is: 

1 A computer-assisted method for identifying potential mimetics of erythropoietin, 
using a programmed computer comprising a processor, a data storage system, an 
input device, and an output device, comprising the steps of: 

(a) inputting into the programmed computer through said input device data 
comprising the three-dimensional coordinates of a subset of the atoms in 
the peptide GGTYSCHFGPLTWVCKPQGG when said peptide is co- 
crystallized with a portion of the erythropoietin receptor comprising 
amino acids 1 to 225 of said receptor, thereby generating a criteria data 
set; 

(b) comparing, using said processor, said criteria data set to a computer 
database of chemical structures stored in said computer data storage 
system; 

(c) selecting from said database, using computer methods, chemical structures 
having a portion that is structurally similar to said criteria data set; 

(d) outputting to said output device the selected chemical structures having 
a portion similar to said criteria data set. 
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A computer-assisted method for identifying potential mimetics of erythropoietin, 
using a programmed computer comprising a processor, a data storage system, an 
input device, and an output device, comprising the steps of: 

(a) inputting into the programmed computer through said input device data 
comprising the three-dimensional coordinates of a subset of the atoms in 
the peptide GGTYSCHFGPLTWVCKPQGG when said peptide is co- 
crystallized with a portion of the erythropoietin receptor comprising 
amino acids 1 to 225 of said receptor, thereby generating a criteria data 
set; 

(b) constructing, using computer methods, a model of a chemical structure 
having a portion that is structurally similar to said criteria data set; 

(c) outputting to said output device the constructed model. 

A compound having a chemical structure selected using the method of claim 1, 
said compound bong an EPO mimetic. 

The compound of claim 3 wherein said compound is not a peptide. 
The compound of claim 3 wherein said compound is a peptide. 



6. 



The compound of claims 5 wherein said peptide has 15 of fewer amino acids. 
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